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Introduction 

E-journals are undoubtedly less widespread in the Humanities than 
in the Scientific, Technical and Medical (STM) field. This is nof 
jusf because of a lag in Humanities journals' developmenf, buf also 
because of fhe differenf exfenf fo which elecfronic publishing fech- 
nologies fif respecfively fhe differenf nafure and requiremenfs of 
disciplines. Neverfheless, fhe benefifs for fradifional journals com¬ 
ing from elecfronic publishing in ferms of knowledge dissemination 
and capabilify of reaching a wider audience are such evidenf fhaf 
fhey cannof be ignored. Given fhese premises, fhe paper aims fo 
design a new model for e-journals specifically tailored on classicists' 
needs. 

In the first two sections I shall describe and examine the results 
of a survey thaf was carried ouf to assess the degree to which digital 
technologies are used in relation to serial publications in Classics. 
The survey was carried out with the support of fhe foundation "Ri- 
nascimenfo Digifale"^ and involved direcf participation of several 
journal editors. Although only Italian journals were taken into exam- 

^http://www.rinascimento-digitale.it. 
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ination, the results were also consistent with those of related studies 
carried out in other countries (Sachini et al.; Ober et al.; Heath, Jubb, 
and Robey; Jdttkandt). The survey was useful to understand the 
current situation of Classics journals and thus to identify critical 
issues as well as opportunities to enhance the overall value of serial 
electronic publications in the Humanities (Waltham). 

In the third section 1 propose two value-added services identified 
as a result of the survey, which are also intended to bootstrap the 
e-journal as a valuable medium for scholarly communicafion in this 
field. Reference linking and semantic indexing were seen as the services 
that could be provided by e-journal publishers that would also likely 
be considered as useful and desirable by the end users. 

Finally, in the last section I analyse the main technical compo¬ 
nents needed for a working implemenfafion of the e-joumal model 
previously described in terms of its theoretical implications. The 
critical issues at each stage of production are addressed, including 
the management of the journal workflow in a digital environment, 
the automatic production of metadata, and the exposure of contents 
on the Web. 


E-journal Publishing: the State of the Art of 
Classics in Italy 

Survey Goal 

The original purpose of the survey was to investigate the state 
of the art of e-journals in Ifaly. Given the low number of journals, 
however, fhat already have an electronic version whose editors 
actually participated to the survey, collected data about current e- 
joumal features cannot be considered highly representative of the 
current situation. Nonetheless the survey is still illustrative since 
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it illuminates the obstacles and difficulties in providing electronic 
access to journal contents in this field, and if shows fo whaf exfenf 
digifal fools are being used in fhe ediforial workflow. 

As has been nofed above fhe scope of fhis paper is resfricfed fo 
one single nafional sifuahon. However, lef us have a quick look af 
fhe infernafional scenario wifhouf any claim fo comprehensiveness. 
It is not straightforward, indeed, to account for fhe sfate of the art of 
e-journals in Classics as a whole. Alfhough if would be exfremely 
useful, a single and up-fo-dafe resource does nof exisf fhaf lisfs elec- 
fronic journals as well as elecfronic versions of fradifional journals 
and provides informafion abouf the publication policy (subscription- 
based or open access). The section on e-journals in the Cristofori's 
Rassegna Strumenti informatici per I'Antichita Classical was last up¬ 
dated back in 2006. The Directory of Open Access Journals (DOAJ)^ 
does not extensively cover the discipline as was pointed out by M. 
Heath {An author is an animal with two ends). Finally, in this respect 
the Ancient World Online (AWOL)"^ at the time of wrifing confains 
901 enfries. 

All fhis said, fhere are some currenf and pasf examples fhaf are 
worfh menfioning. The BMCR® is an elecfronic journal of reviews 
and purports to be "the second oldest online scholarly journal in the 
humanities".® It is an example of fhe pioneering role fhaf Classics 
played in relafion fo fhe use of new media. Unlike fhe BMCR, ofher 
early examples of born-digifal journals such as Arachnion (Ifaly) ^ 
or Hisfos (UK)® did nof lasf longer fhan four issues, alfhough fheir 
websifes are preserved for fhe hisforical record. Ofher elecfronic 

^http://www.rassegna.unibo.it. 

^http://www.doaj.org. 

^http://ancientworldonline.blogspot.com. 

®http://bmcr.brynmawr.edu. 

^http://bmcr.brynmawr.edu/about.html. 

^http://www.cisi.unito.it/arachne/arachne.html. 

®http:/ /www.dur.ac.uk/Classics/histos. 
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only journals that are still active include Leeds International Classi¬ 
cal Studies'^ (UK) and Plato, the journal of the International Plato 
Societyd° 


Methodology 

As a preliminary step, an analytic sample of journals was selected 
for fhe survey. The Annee Philologique (APh) is fhe infernational ref¬ 
erence bibliographic index for Classics and Philology The Centro 
Italiano dell' Annie Philologique (CIAPh)^^ is fhe nahonal ediforial 
board which reviews 276 serial publicafions published in Ifaly on a 
yearly basis as ifs contribution to the completion of APh's issues. The 
number of serials reviewed by fhe ClAPh, however, also includes 
academic bullefins fhaf cannof sfricfly be considered journals. The 
initial list from the ClAPh was then filtered by using a qualitative in¬ 
dex, the European Research Index for Humanifies European Research 
Index for Humanities (ERIH)A in order fo selecf only fhose fifles fhaf 
are considered most valuable by scholars themselves ERIH basically 
provides a catalogue of journals wifh a classificafion by cafegories 
(i.e. "A", "B" and "C") depending on fhe compliance of a journal 
wifh a sef of qualifafive sfandards esfablished by an infemafional 
experf panel. Whaf was finally obfained is a significanf analyfic 
sample consisfing of 55 journals published in Ifaly and included in 
fhe ERIH index. 

The main areas of interesf covered by fhe survey were: 

1. fhe feafures of fhe elecfronic version of fhe journal (when 
available); 

^http://www.leeds.ac.uk/classics/lics/index.html. 

^®http://gramata.umv-parisl.fr/Plato. 

^^http://www.aristarchus.unige.it/ciaph. 

^^http://www.esf.org/research-areas/humanities/ 
erih-european- reference- index- for- the- humanities .html. 
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2. the journal's copyright policy; 

3. the use of electronic tools to support the editorial workflow; 

4. willingness and obsfacles fo innovafion. 

The quesfionnaire consisted of 31 questions in total: 3 were open 
questions covering the fourth area listed above, while the remain¬ 
ing questions were multiple-choice. The survey was administered 
electronically to the editors of fhe journals in fhe sample during fhe 
month of January 2008.^^ A nofeworfhy defail is fhaf - af fhaf fime 
- just 26 journals out of 55 were present on the Web either with an 
electronic version or at least with an institutional or descriptive web 
page. 


Results 

In terms of parficipation, 18 ouf of fhe 55 invifed journal edifors 
participated in the survey and completed the questionnaire. The 
log analysis revealed that in 18 cases the invitation was just ignored 
(i.e. the invitation link was not followed), in 8 cases fhe parficipanf 
opened if buf never sfarfed fhe complefion process, and in 11 cases 
fhe quesfionnaire was jusf partially completed. 

A first result from fhe survey seems confradictory and in some 
way paradoxical. Of fhe journals fhaf responded fo fhe survey, only 
3 of fhe 18 were available electronically, despite the fact that all 18 
journals support electronic submission of paper manuscripts and use 
commercial software to produce the journal version to be printed. 
Nonetheless electronic versions of journal papers were offered only 
in a few cases. This fact shows that despite the production workflow 

^^For the creation of the electronic forms I used the open source software LimeSur- 
vey, http://www.limesurvey.org. 
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was already partly digital the preferred (and often the only) medium 
for publication sfill remained prinf. 

From fhe answers fo fhe questionnaire secfion on copyrighf pol¬ 
icy it was determined that 9 journals out of 18 allowed their authors 
to publish paper pre-prints on the Web, and in 7 cases out of 18 fhe 
publicafion of a paper posf-prinf was allowed. Wifh regard to the 
publishing workflow, it was discovered that a document containing 
author guidelines was accessible electronically in just 5 cases out of 
18, even fhough in all cases such a documenf was sfafed fo exisf. Fi¬ 
nally, fhe results confirmed that Microsoft Word was accepted by all 
of the journals as a valid format for submission of paper manuscripts. 
The most commonly used Greek font for manuscript submission 
was quite surprisingly SuperGreek (accepted by 14 out 18 journals) 
and not - as one would expect given the number of visualisation 
and interoperability issues it helps to solve - any Unicode-encoded 
font (accepted only in 4 cases). 


Understanding the Field 

The survey let emerge a number of issues that were classified into 
four groups. A firsf group concerns the tension between tradition 
and innovation in relation to both the publication format and the 
editorial tradition. 

In those rare cases where the journal is also published electron¬ 
ically, the electronic version still imitates the printed one leading 
to what was called "digital incunabulum" (Crane et ah). This ten¬ 
dency is also reflected by the software in use and by the data formats 
adopted both for submitting to and publishing the journal. Provided 
that even the Microsoft Word format can be used to encode addi¬ 
tional information about the semantics or the logical structure of a 
document (for example by providing a template document show- 
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ing a consistent use of formatting styles that can then be properly 
transformed into mark-up), this happened in just one case out of 18. 
On the other hand, the PDF format - which substantially replicates 
behaviours and paradigms of the printed page - was found to be 
the most frequently used format for electronic publishing. It is clear 
now, however, that the opportunities of shifting to a fully electronic 
publishing environment could allow publishers to maximise the 
benefits of a digital workflow by producing both a print and an 
electronic version of a journal. 

Another insight that emerged from the analysis of the three open 
questions was that the limited adoption of digital technologies was 
perceived as a factor that discouraged further investment into the 
realisation of e-journals in this field. Participants to the survey 
identified the following main difficulties of going digital: the weight 
of the editorial tradition, the problem of converting old journal 
issues to an electronic format, and the difficulty of establishing e- 
journals as authoritative, particularly those journals that do not have 
a solid tradition in print publishing. 

A second group of issues concerns the tension between preci¬ 
sion and rapidity of publication. The promptness of the publication 
process is not as urgent a need for humanists as it is for those in 
other disciplines. Indeed, the longer shelf-life of publications in the 
Humanities has at least two consequences: far less need for rapidity 
and a much greater degree of concern about the long-term preser¬ 
vation of electronic resources published on the Web. Rapidity is 
therefore simply regarded as a consequence of electronic publishing 
and not as one of its most desirable benefits. Furthermore, when 
editors were asked: "Do you believe that the rapidity of publica¬ 
tion allowed by digital technologies negatively affects the overall 
accuracy of publications?", 5 out of 18 participants answered affir¬ 
matively, reflecting a die-hard and systematic prejudice that exists 


4603-7 



M. Romanello, New Value-Added Services for Electronic Journals in Classics 


among many Classics scholars regarding the benefits coming from 
fhe use of new fechnologies. 

Instead of rapidify with regard to electronic resources, two other 
properties were perceived as crucial: long-term persistency and ci¬ 
tation precision (or the capability of creafing precise links bofh fo 
and within resources, also known as link granularity). An electronic 
resource that does not allow users to precisely refer fo its contents 
is considered useless in this field (Heafh). Indeed, the precision of 
bibliographic and canonical references is offen regarded by classi- 
cisfs as an indicator of fhe overall qualify and scienfific value of a 
publicafion. This is probably fhe main reason why a pagmafed dafa 
formaf (PDF) is sfill largely used fo fhe defrimenf of a more expres¬ 
sive buf un-paginafed one (HTML), since fhe former allows one fo 
use page numbering as citation schema (Fitzpatrick). As a result, the 
aforementioned APh - the reference analyfical bibliographic index 
for Classics - does not provide links to electronic versions of printed 
papers even when they are available, nor does its online version. 

Another tension is the one between broad and long term access. 
Electronic journals were still largely perceived by the respondents 
as "for experts only" resources. As long as they are not reviewed (or 
at least referred fo) by fhe reference bibliographic indexes, if will be 
very difficult to promote the production and use of e-journals in this 
field, and consequenfly fo increase scholars' awareness abouf fhem. 

Along wifh fhe issue of broader access fhere is fhe relafed chal¬ 
lenge of digital preservation, or how effectively to provide long-term 
access to electronic resources. A resource which is widely accessi¬ 
ble but extremely unstable or unreliable is not very useful. Indeed, 
since information ageing in fhe Humanifies, as nofed above, is fyp- 
ically higher fhan in ofher disciplines, fhe need to guarantee the 
accessibility of electronic resources over decades has become urgent. 

^^http: / /www.armee-philologique.com/aph. 
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This issue of long-term access also highlights the need for citations 
to resources to be precise and stable over time as well. Given the 
current lack of guarantees for the long-term persistence of cited elec¬ 
tronic resources, the current citation practice is still (and inevitably) 
predominantly paper-oriented(Dalbello et al.). 

Currently the most suitable solution for journal publishers ap¬ 
pears to be WebCite(R),^® a tool that acts upon user requests by 
caching the cited resource and providing the user with a handy link 
to cite it. The project is part of the International Internet Preser¬ 
vation Consortium and its mission specifically includes long-term 
persistence for links and resources (Eysenbach). It has been adopted 
mainly by publishers in the Medicine field including the authorita¬ 
tive BioMed Central. 

Finally, there appeared to be some problems in conciliating open 
access publishing with a sustainable economic model of journal 
publishing. Several editors mentioned open issues with publishers 
as an obstacle to going digital. These issues were mostly related to 
copyright, such as existing policies that do not allow an electronic 
version to be published. Open access was perceived by editors as a 
highly valuable goal to be pursued but at the same time was one that 
seemed to be hardly sustainable from an economic point of view. 
This perception fully reflects the ongoing debate and research about 
Open Access in the Humanities (Cassella), an issue that cannot be 
fully addressed in this paper. From this debate it also emerges how 
on the one hand Humanities fields have a hidden natural tendency 
to Gold Open Access but on the other hand are still not very far on 
the road towards achieving it. 

A commonly held belief that was also observed among Italian 
editors was that the electronic version of a journal would cause 
economic damage by decreasing the revenues that came from print 

i®http://www.webcitation.org. 
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serial subscriptions. Several case studies (Sachini et al.; lacono et 
al.; Heath, Jubb, and Robey; Jottkandt) have recently demonstrated, 
however, that moving to an e-journal publishing model reduces pro¬ 
duction costs by rationalising and automating certain procedures 
typical of the editorial process. Moreover, the shift to an electronic 
publishing model represents a chance for journal publishers fo be¬ 
come not just content providers but also service providers ("Moving 
out of Oldenbourg's long shadow: whaf is fhe fufure for sociefy 
publishing?"; "Sociefy Publishing, fhe Infernef and Open Access: 
Shiffing Mission-Orienfafion from Confenf Holding fo Certification 
and Navigation Services?"). Once journal confenfs are made openly 
accessible, subscriptions can be offered to institutions or individuals 
to access advanced value-added services, such as those suggested 
in this paper. 


Services that Matter 

This survey clearly illustrated that Classics e-journals are not yet 
widespread and thus their use and growth needs to be bootstrapped. 
Even though some interesting and advanced projects exist (Sachini 
et al.; Ober et al.), they are in the minority. Use and awareness of e- 
journals, open archives and open access in general is sfill nof nearly 
as widespread in fhe Humanifies as if is in fhe STM disciplines, 
fhis facf can be observed also by looking af fhe pracfice of cifing 
elecfronic resources in fhese field (Dalbello ef al.). 

My sfarfing assumption in facing fhis problem is fhaf a greaf 
parf of fhe overall value (and benefif) of e-journals comes from 
fhose provided services fhat are considered useful by fhe final users. 
Indeed, fhe main difference between print and electronic journals lies 
in the layer of services provided to the user. Furthermore, providing 
effective services could allow e-journals to move out of fhe digifal 
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incunabula stage. Another reason to focus on service providing is 
- as noted above - to potentially foster a business model that truly 
makes open access economically sustainable. 

1 focused on the identification of new value-added services, and 
in particular on navigation services, starting from the identified 
specific needs and critical requirements of this field. 1 believe that 
citations play a role of primary importance in the Humanities and 
particularly in Classics, a point that has been previously made by oth¬ 
ers (McCarty; Crane). As a preliminary, there is a need to distinguish 
between two kinds of references used in this field to cite relevant 
resources: 1) bibliographic references to monographs, commentaries, 
journal papers etc. (i.e. secondary sources); and 2) canonical refer¬ 
ences, namely short references that usually follow a logical rather 
than a physical citation scheme and are used in order to cite pas¬ 
sages of ancient texts (i.e. primary sources). Given this distinction, 
canonical references are of primary importance for classicists since 
they refer to the research object itself (i.e. ancient texts). Therefore 1 
propose two services that leverage different aspects of citations: 1) 
citations as navigation tool both through and between resources; 2) 
citations as access point to information and as index to resources. In 
the following subsections the main characteristics and implications 
for the proposed services are discussed. 

Reference Linking 

Reference linking is essentially the capability in a digital envi¬ 
ronment to move directly from a citation to the resource to which it 
refers. Reference linking for secondary sources (i.e. modem biblio¬ 
graphic resources) is a feafure already provided by several library 
catalogues, specialised search engines and e-journals. Once an inter¬ 
esting bibliographic reference has been found, a user can sometimes 
directly access it if it is referred to from an electronic resource, or 
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they can check for the availability of an elecfronic copy af fheir own 
insfitution's library in fhe case of a print resource. From a more 
technical perspective, the DOI technology, the CrossRef initiative 
and the OpenURL standard protocol have all contributed in the last 
few years fo fhe solid fechnical foundafion upon which if is now 
possible fo build a range of reference linking services for secondary 
sources (de Sompel and Beif-Arie). 

Reference linking to Classical primary sources (i.e. ancient texts), 
however, remains a desirable but not yet available service. Digital 
Humanities researches as a whole are an attempt to address specific 
scholarly users' needs with new digital tools. 1 believe that browsing 
the links between texts remains for classicisfs a crucial feafure fo be 
provided and as such this problem needs to be addressed by research 
in this field. When looking af fhe currenf sfafe of digifal libraries, if 
is clear how fhere is a paucity of links befween ancienf and modern 
resources. For insfance, editions of ancienf fexfs available fhrough 
Perseus, Google Books and the Internet Archive are not yet linked in 
any way to pertinent research papers stored in the Internet Archive 
itself or wifhin JSTOR. 

For a classicisf if would neverfheless be useful when reading a 
passage by Homer or Aeschylus fo have af hand a lisf of references 
fo resources that discussed topics concerning that passage. Or in 
turn, while reading a research paper or a commentary a scholar 
might want to read the full fexf of a cifed passage from an ancienf 
work. 

Recenfly, some work has been done fo address fhis lack of sfan- 
dards and fools in order fo provide such a reference linking abilify 
for primary sources. Bofh fhe solufions fhaf were proposed up to 
now leverage a network protocol for the implementation, respec¬ 
tively OpenURL and the Canonical Text Services (CTS) protocol. The 
first solution is an OpenURL-based system to link canonical works 
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in the Humanities which was identified to enhance the search inter¬ 
face of fhe Annee Philologique online. Insfead, fhe second solution 
leverages a more discipline-specific protocol to achieve the same 
functionality. 

Indeed, the CTS protocoh^ (Smith) can rightly be considered 
the counterpart of OpenURL for Classical primary sources. This 
protocol basically provides a digital equivalent to print canonical 
references by making texf repositories accessible through logical 
citation schemes. Texts within a CTS repository are accessible, and 
thus citable, by logical hierarchical levels such as book, section 
paragraph or poetic line. Other recent work has also examined 
the technical requirements necessary to implement this feature of 
reference linking by leveraging the CTS protocol and techniques to 
embed citation metadata within (X)HTML documents (Romanello, A 
Semantic Linking System for Canonical References to Electronic Corpora), 
as well as presenting an e-journal prototype where this feature is 
implemented ("A semantic linking framework to provide critical 
value-added services for E-joumals on classics"). 

Semantic Indexing 

The second feafure I propose, semanfic indexing, can be consid¬ 
ered as a pracfical consequence of fhe firsf one. Once references are 
encoded for the purpose of reference linking, it becomes straight¬ 
forward to leverage the available pieces of semantic information 
found within citations in order to index the resources themselves. 
In order to allow operations such as crawling and harvesting of web 
resources it is necessary to envisage a solution that embeds mean¬ 
ingful mefadafa abouf cifafions wifhin web resources. Indexing fhe 
references to Classical primary sources found within documents 

i^http://ctsS.sourceforge.net. 
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provides users with an additional and significant access point to in¬ 
formation. Indeed, while traditional search engines usually perform 
sfring mafching searches over documenfs, a semantic indexing sys- 
fem undersfands fhe semanfics of cifafions no maffer whaf sfrings 
are used fo express fhem. In ofher words, semantic indexing allows 
users fo access Classics e-joumal confenfs as if fhey were using ci¬ 
fafions fo ancienf aufhors and works as grouping keys. In ofher 
disciplines, such as chemisfry or genome research, references fo 
chemical elemenfs and compounds are normally exfracfed, classi¬ 
fied and fhen employed fo enhance user access fo confenfs fhrough 
fhe addition of semanfic awareness. 


Implementing an Innovative Classics 
E-journal 


In fhis section fhe problem of how fo devise a scalable imple¬ 
mentation of such a Classics e-journal is examined, with a particular 
focus on fhe new proposed services, fhaf is reference linking and 
semanfic indexing. The whole publishing process is faken info 
considerafion, from fhe editorial workflow managemenf fo fhe pre- 
senfafion and access of research confenfs on fhe Web. Indeed fhose 
aspecfs represenf fhe main issues fhaf are likely fo be faced by a 
fradifional prinf journal on fhe path of fransifion fo an elecfronic 
publishing model. 
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Choosing a Platform to Manage the Editorial 
Workflow 

As of this writing, several open source solutions exist in terms 
of e-journal publishing platform.^^ Open Journal Systems (OJS) (Will- 
insky), however, is the best fit in terms of the previously listed needs 
and technical requirements for a Classics e-journal. Firstly, OJS 
provides a plug-in mechanism that allows developers to extend its 
functionalities. In our case, this architectural detail enable us to 
implement new value-added services tailored to specific user needs 
as components that could be plugged into OJS' architecture. 

Secondly, OJS supports batch operations (configurable through 
XML files) to import old journal issues. As both the case studies 
and our survey demonstrated, the ability to easily convert print 
journal back-runs into a digital format constituted a preliminary 
step of essential importance for a journal migrating to an electronic 
platform. 

Moreover, the survey illustrated that the most frequently used 
format for journal submissions in the field of Classics is Microsoft 
Word, a proprietary data format. While allowing for the fact that 
our data refers only to Italian journals, it can be considered as a 
reasonable indicator of tendencies in the whole field. This situation 
compels us to find a trade-off solution between the optimum (XML 
structured encoding) and the current practice (the use of Microsoft 
Word). Indeed, in order to provide users with advanced services it 
is necessary to have some kind of structured information available. 

The OJS framework provides Lemon8-XML (L8X) for this pur¬ 
pose, a tool that converts word processor file formats into XML 
mark-up compliant with the Journal Publishing Tag Set defined by 

^^http: / / www.openoasis.org/index.php?option=com_content&view= 
article&id=353&Itemid=379. 
i^http://www.lemonS.org. 
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the US National Library of Medicine. To reach this goal, OJS applies 
a set of heurisfics fo defecf and mark up logical sections of journal 
papers, such as title, abstract, sections, and references among ofhers. 

Encoding References (Semi-)Automatically 

Since mefadafa producfion is bofh a time consuming and expen¬ 
sive task, providing value-added services in a scalable way means 
finding automatic procedures that can help with its creation. The 
metadata to be extracted from journal articles are canonical and 
modem bibliographic references since they are needed to provide 
semantic indexing and reference linking to both primary and sec¬ 
ondary sources. 

L8X, which is part of the OJS framework, supports integration 
with multiple citation parsers such as Paracite,^® Freecite^® and 
ParsCiU^ that aim to detect bibliographic references within scholarly 
papers (Suhonos). The output obtained from rurming fhose parsers 
is then combined with search results from freely-available online 
indexes in order fo supporf manual correction of automatically de¬ 
tected metadata through a graphical user interface.^^ Provided 
that the above tools are mostly used to parse references wifhin arfi- 
cles from the STM field, it is noteworthy that ParsCit was recently 
provided with some initial Humanities training datasets. Indeed, 
ParsCit employs a machine-learning approach whose main feature 
is the possibility of framing fhe parser on dafa from differenf disci¬ 
plines. As far as concerns the automatic extraction of canonical refer¬ 
ences from scholarly articles, fhis feafure is nof currently supported 
by any journal publishing platform. Its technical feasibility, however, 

i^http: / /paracite.eprints.org. 

^®http: / /freecite.library.brown.edu. 

^^http: / /aye.comp.nus.edu.sg/parsCit. 

^^http: / /pkp.sfu.ca/18x/Lemon8-Architecture.pdf. 
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was demonstrated by recent work (Romanello et al.) that is leading 
to the implementation of a Canonical References Extractor (CRefExf-^ 
based on a machine-learning approach. Such an extraction engine 
can be eventually integrated as a plug-in into the O/S architecture. 

Enriching Web Resources with metadata Embedding 

Embedding metadata within Web pages has at least two major 
practical results, first, it makes metadata discoverable by both web 
crawlers and search engines, and, second, it can be reused by scripts 
or applications running on the client side. In the case of fhe fwo 
services fhaf I am proposing, fhe mefadafa fhaf need to be embed¬ 
ded are those related to the modern and canonical bibliographic 
references fhaf are confained in a given article. Moreover, fhe main 
search engines such as Yahoo! and Google have recently started 
crawling semantic information embedded within web pages in the 
form of eifher Microformats and RDFa. Microformats (Allsopp) and 
RDFa (Velez) are currently the most suitable and widely adopted 
techniques used to embed metadata within Web resources. 

Some recent work conducted within the Microformats commu¬ 
nity boosted general interest about semantic metadata embedding 
techniques, many of which are now regarded as a pracfical way 
fo realise fhe Semanfic Web. Low inferesf was expressed by the 
Microformats commrmity, however, regarding the development of 
non-commercial Microformats. The main difference between the two 
technologies is that while RDFa applies to any set of RDF-encoded 
dafa. Microformats are defined and need fo be approved by a de¬ 
velopers communify. Therefore, RDFa seems fo be a sfandard more 
complianf wifh fhe decenfralised nafure of fhe Web ifself (Graf). 

CoinS^^ is a widely adopfed convention used for publishing bib- 

https:/ / github .com / mromanello / CRefEx. 

^^OpenURL ContextObject in SPAN, http://ocoins.info. 
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liographic references in HTML using the OpenURL protocol and is 
very similar to Microformats even though it was not formally ap¬ 
proved by the Microformats community. Zotero^® is one of the most 
important applications that leverages CoinS-encoded references and 
it allows users to import them into their libraries. As far as con¬ 
cerns metadata about canonical references, the use of Microformats 
has been suggested by initial studies (Romanello, A Semantic Link¬ 
ing System for Canonical References to Electronic Corpora-, "A semantic 
linking framework to provide critical value-added services for E- 
joumals on classics"). However, the best solution is believed now 
to be employing RDFa to embed semantic information specified by 
means of an ontology. The described metadata embedding mecha¬ 
nism can be added to OJS as an export filtering option to produce 
a (X)HTML-i-RDFa output, or by extending its HTML rendering 
mechanism. 


Conclusions 

In this paper I attempted to draw attention on services that were 
observed to be desirable for users of the Classics field, and more 
generally on the importance of devising value-added services that 
actually fit scholars' needs. The hope for the immediate future is 
that Classics e-journals will move out of the incunabular stage by 
providing the users with features tailored to their needs. 


^®http: / /www.zotero.org. 
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